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Abstract 

It has been shown that a best rank-i? approximation of an order-fc tensor may not exist when 
R > 2 and k > 3. This poses a serious problem to data analysts using tensor decompositions. 
It has been observed numerically that, generally, this issue cannot be solved by consecutively 
computing and subtracting best rank-1 approximations. The reason for this is that subtracting 
a best rank-1 approximation generally does not decrease tensor rank. In this paper, we provide 
a mathematical treatment of this property for real-valued 2x2x2 tensors, with symmetric 
tensors as a special case. Regardless of the symmetry, we show that for generic 2x2x2 tensors 
(which have rank 2 or 3), subtracting a best rank-1 approximation results in a tensor that has 
rank 3 and lies on the boundary between the rank-2 and rank-3 sets. Hence, for a typical tensor 
of rank 2, subtracting a best rank-1 approximation increases the tensor rank. 
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1 Introduction 



Tensors of order d are defined on the outer product of d linear spaces, Si, 1 < i < d. Once 
bases of spaces Si are fixed, they can be represented by d-way arrays. For simphcity, tensors are 
usuahy assimilated with their array representation. We assume throughout the following notation: 
underscored bold uppercase for tensors e.g. X, bold uppercase for matrices e.g. T, bold lowercase 
for vectors e.g. a, calligraphic for sets e.g. S, and plain font for scalars e.g. Xj^^, Tij or Oj. In this 
paper, we consider only 3rd order tensors. The three spaces of a 3rd order tensor are also referred 
to as the three "modes". 

Let X be a 3rd order tensor defined on the tensor product Si <^ S2 <^ S3. If a change of bases is 
performed in the spaces Si,S2,S3 by invertible matrices S,T,U, then the tensor representation X 
is transformed into 

X = (S,T,U)-X, (1.1) 

whose coordinates are given by Xijk = '^pqr SipTjqUkr Xpq^. This is known as the multilinearity 
property enjoyed by tensors. Matrices, which can be associated with linear operators, are tensors 
of order 2. The multilinear transformation (II. ip is also denoted as 

X = X»i S.2T.3U, (1.2) 

where •£ denotes the multiplication (or contraction) operator in the ^th mode. Note that the matrix 
multiplication STU"^ can be denoted as T •! S '2 U = T "2 U •! S. For two contractions with 
matrices in the same mode, we have the rule X T S = X •£ (ST), see e.g. [H section 2]. 

The rank of a tensor X is defined as the smallest number of outer product tensors whose sum 
equals X, i.e. the smallest R such that 

R 

X = (K> (8> . (1-3) 

■r=l 

Hence a rank-1 tensor X is the outer product of vectors a, b, c and has entries Xjj^ = aibjCk- The 
decomposition of a tensor into a sum of outer products of vectors and the corresponding notion of 
tensor rank were first introduced and studied by [13] |15j . 

Tensors play a wider and wider role in numerous application areas including blind source sep- 
aration techniques for Telecommunications [27] [28] [6] [11] [8], Arithmetic Complexity [22] [35] 
[I] [33], or Data Analysis [13] [2] [29] [21]. In some applications, tensors may be symmetric only 
in some modes, or may not be symmetric nor have equal dimensions. In most applications, the 
decomposition of a tensor into a sum of rank-1 terms is relevant, since tensors entering the models 
to fit have a reduced rank. For example, such a tensor decomposition describes the basic structure 
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of fourth-order cumulants of multivariate data on which a lot of algebraic methods for Independent 
Component Analysis are based [3] [10] [16] . For an overview of applications of tensor decomposi- 
tions, refer to |19j . 

An important advantage of using tensor decompositions of order 3 and higher, is that the 
decomposition is rotationally unique under mild conditions |22j |33j . This is not the case for most 
matrix decompositions, e.g. Principal Component Analysis. However, the manipulation of tensors 
remains difficult, because of major differences between their properties when we go from second 
order to higher orders. We mention the following: (i) tensor rank often exceeds dimensions, (ii) 
tensor rank can be different over the real and complex fields, {in) maximal tensor rank is not 
generic, and is still unknown in general, (iv) generic tensor rank may not have a single value over 
the real field, (v) computing the rank of a tensor is very difficult, (vi) a tensor may not have a best 
rank-i? approximation for R > 2. For {i)-{v), see e.g. [53] [T^] [7j. For (iv), see e.g. [37] [SH]. For 
(vi), see e.g. [HD] [ST] [32] [12] [20] [M]- A discussion specifically focussed on symmetric tensors can 
be found in [7]. 

In [12] it is shown that (vi) holds on a set of positive measure. It is recalled in [7] and [12] 
that any tensor has a best rank-1 approximation. However, it has been observed numerically in 
[181 section 7] that a best or "good" rank-i? approximation cannot be obtained by consecutively 
computing and substr acting R best rank-1 approximations. The reason for this is that subtracting a 
best rank-1 approximation generally does not decrease tensor rank. Hence, the deflation technique 
practiced for matrices (via the Singular Value Decomposition) cannot generally be extended to 
higher-order tensors. A special case where this deflation technique works is when the tensor is 
diagonalizable by orthonormal multilinear transformation; see \18\ section 7]. 

In this paper, we provide a mathematical treatment of the (in)validity of a rank-1 deflation 
procedure for higher-order tensors. We consider 2x2x2 tensors over the real field. For such a 
tensor X, let the frontal slabs be denoted as Xi and X2. Our main result is for generic tensors 
X, which have rank 2 if X2X||~^ has distinct real eigenvalues, and rank 3 if X2X^^ has complex 
eigenvalues. We show that for generic Xj subtraction of a best rank-1 approximation Y yields 
a tensor Z = X — Y of rank 3. Hence, for a typical X of rank 3 this does not affect the rank, 
and for a typical X of rank 2 this has increased the rank. In fact, we show that Z lies on the 
boundary between the rank-2 and rank-3 sets, i.e. Z2Zj~^ has identical real eigenvalues. The result 
that subtraction of a best rank-1 approximation yields identical eigenvalues is new and expands the 
knowledge of the topology of tensor rank. Also, we show that the same result holds for symmetric 
2x2x2 tensors. Based on numerical experiments we conjecture that the results can be extended 
to p X p X 2 tensors over the real field. 
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The above contributions are new to the hterature on best rank-1 approximation of higher-order 
tensors. The latter includes best rank-1 approximation algorithms [9] [39] |18j . conditions under 
which the best rank-1 approximation is equal to the best symmetric rank-1 approximation [25], 
and a relation between the best symmetric rank-1 approximation and the notions of eigenvalues 
and eigenvectors of a symmetric tensor |5] |26j . 

This paper is organized as follows. In Section 2, we introduce the best rank-1 approximation 
problem for 3rd order tensors, and state first order conditions for the optimal solution. Next, we 
consider 2x2x2 tensors. Section 3 contains rank criteria and orbits for 2x2x2 tensors. In Section 
4, we present examples and general results for subtraction of a best rank-1 approximation from a 
2x2x2 tensor. In Section 5, 6 and 7, we discuss the special case of symmetric tensors. Section 
5 provides first order conditions for the best symmetric rank-1 approximation of a symmetric 3rd 
order tensor. Section 6 contains rank criteria and orbits of symmetric 2x2x2 tensors. These results 
are used in Section 7, when studying the subtraction of a best symmetric rank-1 approximation 
from a symmetric 2x2x2 tensor. Section 8 contains a discussion of our results. The proofs of 
our main results are contained in appendices. 



2 Best rank-1 approximation 

We consider the problem of finding a best rank-1 approximation to a given 3rd order tensor X G 

^diXd2Xd3 ^ 

minxGMdi,yGR'*2,zGM''3 | |X " X y z| | ^ , (2.1) 

where || • || denotes the Frobenius norm, i.e. ||X|P = X^jjfc l^ijfeP- Since the set of rank-1 tensors 
is closed, problem ()2.ip is guaranteed to have an optimal solution [12^ proposition 4.2]. Note that 
the vectors x, y, z of the rank-1 tensor (x ® y (8) z) are determined up to scaling. One could impose 
two of the vectors to be unit norm. 

The criterion of (j2.ip can be written as 

^ = ||X||2 -2X. 1X^.2 .32^ + ||x||2||y||2||z||2. (2.2) 

When the gradients of ^ with respect to the vectors x, y, z are set to zero, we obtain the following 
equations: 

_ x>2 '3 _ X >i x^ >3 z^ _ X >i x^ >2 „x 

~ II II9II 119 ' y~ II II9II 119 ' ^~ II II9II 119 • [^■'^) 

||y|| ||z|| ||x||^||z||^ ||x|p||y||^ 

Substituting 

X •9 y "3 z , , 

^ = ~|l II2II 112 ' 2.4 

l|y|| ll^ll 
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into the last two equations of ()2.3p . we obtain 

(X -3 z'^) •! (X»3 z^) '2 y'^ = Ay , (X '2 y'^) •! (X '2 y'^) 's z'^ = /tiz, (2.5) 

where A = | |x| p| |y| p| |z| |^ and fi = | |x| p| |y| |^| |z| p. Hence, y is an eigenvector of the matrix 
(X "3 z^) •! (X "3 Z"^) and z is an eigenvector of the matrix (X "2 y"^) 'i (X "2 y"^)- 
Substituting (|2.4p into the criterion (j2.2p yields 

,,9 (X "2 y"^ '3 Z"^) •! (X "2 y"^ '3 z"^) ,, ,,9 I |X "2 y"^ '3 z"^! P 

^ = X _ ^— ^ II 19!, 779 ^ = X - ,, H>>ii M>> ■ 2.6) 

I |y| I I |z| I I |y| I I |z| 1^ 

Hence, a best rank-1 approximation (x ® y ® z) of X is found by minimizing (12. 6|) over (y, z) and 
obtaining x as (12. 4p . The stationary points (y, z) are given by ()2.5p . which can also be written as 



iv . . „T||2 

(X»2y -sz 'UX'sz ) = n-Ti2 y^-^i 

|yr 



IIY - -,T - „T||2 

/V T T\ /V T\ A»2y Z , , 

(X»2 y '3 Z^ ) •! (X -2 y ) = rr-no Z. (2.8) 

I \z\ \ 

Next, we consider transformations of the best rank-1 approximation. The following well-known 
result states that a best rank-1 approximation is preserved under orthonornial multilinear trans- 
formation. 

Lemma 2.1 Let S,T,U he orthonornial matrices. If a tensor admits Y as a best rank-1 ap- 
proximation, then (S,T,U) • Y is a best rank-1 approximation o/(S,T,U) • X- 

Proof. Let Y = x(8)y(8)zbea best rank-1 approximation of X, and let X = (S, T, U) • X. Since 
orthonormal transforms leave the Frobenius norm invariant, we obtain the following analogue of 



|X-x(g)y0z|p = ||X|p - 2X •! x'^ -2 y^ '3 z'^ + ||x|p||y|p||z|p 



|X|P - 2X •! (x^S) .2 (y^T) .3 (z^U) + ||S^i||2||T^y||2||U^z||2 . (2.9) 



Hence, since (x,y, z) is a minimizer of (12. 2p . it follows that (Sx, Ty,Uz) is a minimizer of ()2.9p . 
In other words, a best rank-1 approximation of X is given by (Sx (S> Ty Uz). This completes the 
proof. □ 

As we will see later, most tensors have multiple locally best rank-1 approximations, with one of 
them being better than the others (i.e., a unique global best rank-1 approximation). Our final 
result in this section states a condition under which there exist infinitely many best (global) rank-1 
approximations. 
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Proposition 2.2 Let X be such that the matrix (X "a z^) is orthogonal for any vector z, and 



(X '2 y ) is orthogonal for any vector y. Then X has infinitely many best rank-1 approximations. 

Proof. The proof follows from equation p.Sp for the stationary points (y,z). The conditions of 
the proposition imply that the matrices (X "3 z^) •! (X "3 z^) and (X "2 y"^) 'i (X "2 y"^) are 
proportional to the identity matrix for any y and z. Therefore, any vector is an eigenvector of 
these matrices, and (I2.5p holds for any (y,z). 

Since any (y, z) is a stationary point of criterion ()2.6p . it follows that the latter is constant. We 
conclude that any (x ® y ® z) with x as in (j2.4p . is a best rank-1 approximation of X- D 



Below is a 2 X 2 X 2 example satisfying the conditions of Proposition 
with two slabs Xi and X2 as [Xi | X2]. 



We denote a tensor X 



Example 2.3 Let 



X 



1 


-1 


1 


1 



(2.10) 



Then for any choice of nonzero vector z, the matrix (X z^), obtained by linear combination of 
the above two matrix slices, is orthogonal. Also, for any nonzero vector y, the matrix (X '2 y"^) is 
orthogonal. Hence, X has infinitely many rank-1 approximations. One can verify that each rank-1 
approximation (x y z) with x as in ()2.4p . satsifies | |X — x (8) y (8) z| p = 3. □ 



The tensor ()2.10p has rank 3 and is studied in [36] where it is shown that it has no best rank-2 
approximation, the infimum of ||X ~ Y|p over Y of rank at most 2 being 1. A more general 
result is obtained in [T^ where it is shown that no 2 x 2 x 2 tensor of rank 3 has a best rank-2 
approximation. In [30] it is shown that any sequence of rank-2 approximations Y^"^ for which 
I | X — Y^^^lp converges to the infimum of 1, features diverging components. 



3 Rank criteria and orbits of 2 x 2 x 2 tensors 

It was shown in [121 section 7] that 2x2x2 tensors (over the real field) can be transformed by 
invertible multilinear multiplications (jl.ip into eight distinct canonical forms. This partitions the 
space m2x2x2 gigj^t distinct orbits under the action of invertible transformations of a tensor 
"from the three sides" . 

Before the eight orbits are introduced, we define some concepts. A mode-n vector of a di x ^2 x ^3 
tensor is an (i„ x 1 vector obtained from the tensor by varying the n-th index and keeping the other 
indices fixed. The mode-n rank is defined as the dimension of the subspace spanned by the mode-n 
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vectors of the tensor. The multilinear rank of the tensor is the triplet (mode-1 rank, mode-2 rank, 
mode-3 rank). The mode-n rank generahzes the row and column rank of matrices. Note that a 
tensor with multilinear rank (1, 1, 1) has rank 1 and vice versa. The multilinear rank is invariant 
under invertible multilinear transformation [12|, section 2]. 

Related to the orbits of 2 x 2 x 2 tensors is the hyperdeterminant. Slab operations on [Xi | X2] 
generate new slabs of the form Ai Xi + A2 X2. There holds 

det(Ai Xi + A2 X2) = Xj det(Xi) + X, X, det(Xi + X^) - det(Xi - X^) ^ ^^^^^^^ ^^^^^ 

The hyperdeterminant of X) denoted as A(X), is defined as the discriminant of the quadratic 
polynomial ()3.ip : 

2 



AfX) 



det(Xi + X2) - det(Xi - X2) 



4det(Xi)det(X2). (3.2) 



2 

Hence, if A(X) is nonnegative, then a real slabmix exists that is singular. If A(X) is positive, 
then two real and linearly independent singular slabmixes exist. It follows from ()3.ip - (l3.2p that the 
hyperdeterminant is equal to the discriminant of the characteristic polynomial of det(Xi)X2Xj~^ 
or det(X2)XiX2^ The sign of the hyperdeterminant is invariant under invertible multilinear 
transformation \\.2\ section 5]. 

Table [1] lists the canonical forms for each orbit as well as their rank, multilinear rank and 
hyperdeterminant sign. Generic 2x2x2 tensors have rank 2 or 3 over the real field, both on a set 
of positive measure [23] [37] . 

For later use, we state the following rank and orbit criteria. The rank criteria have been proven 
for p X p X 2 tensors in [T7] . The 2x2x2 orbits can be found in [121 section 7] . In the sequel, we 
will use this result to verify the orbit of a 2 x 2 x 2 tensor. 

Lemma 3.1 Let X&ea2x2x2 tensor with slabs Xi and X2. 

(i) If J^2^i^ or X1X2 ^ has real eigenvalues and is diagonalizable, then X is in orbit G2. 

(ii) If X2X^^ or XiX2^"'^ has two identical real eigenvalues with only one associated eigenvector, 
then X is in orbit D3 . 

(Hi) //X2XJ^^ or X1X2 ^ has complex eigenvalues, then'K is in orbit G3. 

□ 
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canonical form 



tensor rank multilinear rank sign A 



• 



D' 



n" 



Go 



: 



G, : 



























1 























1 














1 








1 








1 














1 

















1 





1 




















1 





1 


1 





1 












-1 


1 


1 


1 



(0,0,0) 

(1,1,1) 

(2,2,1) 
(1,2,2) 
(2,1,2) 
(2,2,2) 
(2,2,2) 
(2,2,2) 



+ 



Table 1: Orbits of 2 x 2 x 2 tensors under the action of invertible multilinear transformation 
(S, T, U) over the real field. The letters D and G stand for "degenerate" (zero volume set in the 
8-dimensional space of 2 x 2 x 2 tensors) and "typical" (positive volume set), respectively. 
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4 Best rank-1 subtraction for 2x2x2 tensors 



For 2x2x2 tensors X in the orbits of Table [H we would like to know in which orbit X — Y is 
contained, where Y is a best rank-1 approximation of X. In this section, we present both examples 
and general results. We begin by formulating our main result. It is not a deterministic result, but 
involves generic 2x2x2 tensors, which are in orbits G2 and G3. Any tensor randomly generated 
from a continuous distribution can be considered to be typical. The full proof of Theorem 14.11 is 
contained in Appendix A. 

Theorem 4.1 Let ^ be a generic 2x2x2 tensor, and let Y_ he a best rank-1 approximation of 
X . Then almost all tensors X ~ X c^^e ifT- orbit D3. 

Proof sketch. We proceed as in the first part of Section 2. We show that there are eight station- 
ary points (y,z) satisfying (j2.7p - (j2.8|) . and that these can be obtained as roots of an 8th degree 
polynomial. There are two stationary points that yield x = in (|2.4|) . and do not correspond to 
the minimum of criterion (j2.2|) . For the other six stationary points, we have A(X — Y) = 0, where 
Y is the corresponding rank-1 tensor. Finally, we show that the multilinear rank of X — Y equals 
(2, 2, 2) for these six rank-1 tensors Y. Hence, it follows that the best rank-1 approximation Y 
satisfies A(X — Y) = and that the multilinear rank of X — Y is equal to (2, 2, 2). From Table [T] 
it then follows that X — Y is in orbit . □ 

Hence, for typical tensors in orbit G2, subtracting a best rank-1 approximation increases the rank 
to 3. For typical tensors in orbit G3, subtracting a best rank-1 approximation does not affect the 
rank. This is completely different from matrix analysis. 

In the proof of Theorem 14.11 in Appendix A, it is shown that the slabs of Z = X — Y are non- 
singular almost everywhere. From Lemma |3. II it follows that Z2'Z^^ has identical real eigenvalues 
and is not diagonalizable, while X2X]^^ has either distinct real eigenvalues or complex eigenvalues. 
Hence, the subtraction of a best rank-1 approximation yields identical real eigenvalues. 

Next, we consider X in other orbits, and present deterministic results. We have the following 
result for the degenerate orbits of ranks 1 and 2. 

Proposition 4.2 Let X&ea2x2x2 tensor, and let Y_be a best rank-1 approximation of X. 

(i) //X is in orbit Di, then J^ — 'Y_ is in orbit Dq. 

(ii) //X is in orbit D2, D'2, or D2, then ^ — 'Y_ is in orbit Di. 

Proof. For X in orbit Di it is obvious that Y = X is the unique best rank-1 approximation. Then 
X — Y is in orbit Dq. 



9 



Next, let X be in orbit D2. Then there exist orthonormal S, T, U such that 

(S,T,U)-X 



A 























(4.1) 



see [121 proof of lemma 8.2]. Subtracting a best rank-1 approximation from this tensor results in 
A or /i being set to zero (whichever has the largest absolute value; for X = fi there are two best 
rank-1 approximations). Hence, the result is a rank-1 tensor. From Lemma |2. II it follows that the 
same is true for subtracting a best rank-1 approximation from X- For X in orbits D2 and D2 the 
proof is analogous. □ 

For X in orbit G2 or D3, the tensor X — Y is not restricted to a single orbit. The following examples 
illustrate this fact. 



Example 4.3 Let 



X 



nonzero entry of Y is either Yin or 5^222)- 
On the other hand, consider 

X = 



1 










1 




Table [H It can be seen that X ~ Y is in Di 


1 


1 




1 


2 





(4.2) 



(4.3) 



For this tensor, X2X]^ ^ has two distinct real eigenvalues. Hence, by Lemma |3.1|, the tensor is in 
orbit G2. It can be shown that X has a unique best rank-1 approximation Y and that 



X- Y 



1 


1 


1 






(4.4) 



which is the canonical tensor of orbit D3 in Table [TJ 



□ 



Example 4.4 It follows from Lemma l3. II that the following tensors are in orbit Dy. 



2 


1 




1 


1 




1 


2 





1 







2 







1 



Subtracting the best rank-1 approximation Y from these tensors amounts to replacing the element 
2 by zero. Hence, X — Y is in orbit D2, D'2, and respectively. □ 
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On the other hand, it can be verified numerically or analytically that for X equal to the canonical 
tensor of orbit D3 in Table [H we have X. — Y also in orbit D3. Moreover, numerical experiments 
show that for a generic X in orbit D^, we have X — Y in orbit D3 as well. This suggests the 
following. 

Conjecture 4.5 IfX^ is in orbit and'Y_ is a best rank-1 approximation ofJ^, then almost all 
tensors X ~ Y are in D3. □ 



The tensor X in Example 1 2. 3 1 is in orbit G3 by Lemma f3. II It can be shown that any of the infinitely 
many best rank-1 approximations of X yields X~ Y in orbit D^, (proof available on request). The 
example below yields the same result for another X in orbit G3. Numerically and analytically, we 
have not found any X in orbit G3 for which X ~ Y is not in orbit D^. 



Example 4.6 Let 



X 



1 


-2 


1 


1 



(4.6) 



Since X2X^ ^ has complex eigenvalues, X is in orbit G3 by Lemma |3. II It can be verified that X 
has a unique best rank-1 approximation such that 



X- Y 



1 











1 


1 



(4.7) 



The latter tensor can be transformed to the canonical form of orbit D3 by swapping rows within 
each slab (i.e., by applying a permutation in the first mode). □ 



Our next result concerns tensors with diagonal slabs, i.e. 



X 



a 


e 





d 





h 



Then X has rank at most 2, since 



X 



1 





a 
e 



+ 




1 



(4.^ 



d 
h 



(4.9) 



Also, if ah 7^ de, then X = (I2,I2,U) • X, where X is the canonical tensor of orbit G2 in Table [H 
and 

a d 
e h 



U 



(4.10) 
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Hence, in this case X is in orbit G2. 

We show that, for 2x2x2 tensors with diagonal slabs, we have X — Y in orbit Di. Naturally, 
the same holds for X that can be transformed to a tensor with diagonal slabs by orthonormal 
multilinear transformation (see Lemma I2.ip . Note that tensors with diagonal slabs in orbit G2 
form an exception to the result of Theorem 14.11 as does the canonical tensor of orbit G2 (see 
Example 14. 3p . However, Theorem 14 . 1 1 st ates that these exceptions form a set of measure zero. 

Proposition 4.7 Let X6ea2x2x2 tensor with diagonal slabs and rank 2, and let Y_ be a best 
rank-1 approximation ofK. Then 'K — ^ is in orbit Di. 

Proof. We use the first part of Section 2. Let X be as in ()4.8p . First, we assume + < + /i^. 
For 



Y= , (4.11) 

d h 

we have ||X — Y|p = + e'^ and X ~ Y in orbit Di. Next, we show that ()4.1ip is the unique best 
rank-1 approximation of X- Using ()2.6p . the equation ||X — Y|p < + can be written as 

{d^ + h^) (yl + yl) {z\ + zl) < {ayizi + eyizs)^ + {dy2Zi + hy2Z2f , (4.12) 

which can be rewritten as 

(d^ + h^-a^- e^) yl {zf + z|) + {ezi - az2f yf + {hzi - dz2f yl<0. (4.13) 

Since {d^ + — — e^) is positive by assumption, and y nor z can be all-zero, it follows that 
()4.13p can only hold with equality, that is, for yi = and hzi = dz2. Using ()2.4p . it then follows 
that the Y for which we have equality in ()4.13p is given by ()4.1ip . This shows that ()4.1ip is the 
unique best rank-1 approximation of X- 

Next, we consider the case -\- e^ > d'^ + . Analogous to the first part of the proof, it can be 
shown that 

n (\ e (\ 

(4.14) 

is the unique best rank-1 approximation of X- This implies that ||X ~ Yp = -|- /i^ and X — Y 
is in orbit Di. 

Finally, we consider the case + e^ = dP + h"^ . Here, we have multiple best rank-1 approx- 
imations. Setting yi = in ()4.13p yields ()4.1ip as a best rank-1 approximation. Setting ^2 = 
yields (|4.14p as a best rank-1 approximation. If ah = de, then (I4.13P can also be satisfied by setting 



a 


e 
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ezi = az2 and hzi = dz2- This yields 



Via 2/12/20 
yiy2d yld 



vie 



yiy2e 

,2 1 



(4.15) 



It can be verified that for Y in (|4.15|) we have ||X 
Di. This completes the proof. 



ymh y^h 

Y|p = d'^ + h? = a? + and X - Y in orbit 

□ 



5 Best rank-1 approximation for symmetric tensors 

Here, we consider the best rank-1 approximation problem for a 3rd order tensor X E '^dxd-xd ^Yiq^ 
is symmetric in all modes, i.e. Xijk = Xjik = X^ji = Xikj = Xjki = X^ij. We assume the same for 
the rank-1 approximation, which yields the problem 

minygB,<* 1 12£ - y 1^ yi^ y| 1^ • (5.1) 

An adaption of \12\ proposition 4.2] shows that problem ()5.ip always has an optimal solution. 
Analogous to the first part of Section 2, the criterion of (15. ip can be written as 



*2 = I |X| I' - 2 X -1 y'^ -2 y"^ -3 y^ + I |y 1 1' • (5.2) 



When the gradient of ^'2 with respect to y is set to zero, we obtain 

XT T 
_•! y '2 y 



\y\\' 



(5.3) 



Substituting (j5.3p into the criterion (j5.2p yields 



^2 = ||X||^- "-'^^^^'/^ " ■ (5.4) 

Hence, a best symmetric rank-1 approximation (y (8) y y) of X is found by minimizing (|5.2p or 
(|5.4p over y. The stationary points y are given by (|5.3p : this was already noticed in [SJ section 2.3]. 



6 Rank criteria and orbits of symmetric 2x2x2 tensors 

Here, we consider real symmetric 2x2x2 tensors. We establish their ranks and orbits under 
invertible multilinear transformation (S,S,S) • X. These transformations preserve the symmetry. 
The symmetric tensor rank [TJ section 4] is defined as the smallest R such that 

R 

X = Sir ® SLr ® Sbr . (6.1) 



r=l 
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There is a bijection between symmetric d x d x d tensors and homogeneous polynomials of degree 
3 in d variables. A symmetric d x d x d tensor X is associated with the polynomial 



p{ui, . . . , Urf) = ^ Xijk UiUjUk 
ijk 



(6.2) 



A multilinear transformation (S, S, S) • X is equivalent to a change of variables v = Su in the 
associated polynomial. 

The symmetric rank of symmetric 2 x 2 x • • • x 2 tensors can be obtained from the well-known 
Sylvester Theorem, which makes use of the polynomial representation [24:\ section 5] [1]. For generic 
symmetric 2 x 2 x • • • x 2 tensors, d] show that the Sylvester Theorem defines an algorithm to 
compute a symmetric decomposition (j6.ip with R equal to the symmetric rank. Below, the Sylvester 
Theorem for symmetric 2x2x2 tensors is formulated. 

Theorem 6.1 (Sylvester) A real symmetric 2x2x2 tensor with associated polynomial 

p{ui,U2) = 73 nf + 3 72tifu2 + 371 niUg + 70^1 , (6.3) 

has a symmetric decomposition <\6.1\ into R rank-1 terms if and only if there exists a vector g = 
{go,-- -,9Rf with 

70 ■ ■ ■ IR 



71 



(6.4) 



IZ-R -■■ 73 

and if the polynomial q{ui, U2) = Qr + • ■ ■ + Qi uiU2~^ + go has R distinct real roots. □ 



For our purposes, we make use of a symmetric rank criterion similar to Lemma 13.11 formulated 
as Lemma 16.21 below. The link between this rank criterion and the Sylvester Theorem will be 
explained at the end of this section. 

Let the entries of a symmetric 2x2x2 tensor be denoted as 



X 



a 


b 


b 


c 


b 


c 


c 


d 



(6.5) 



For later use, we mention that the hyperdeterminant (j3.2p of X in (j6.5p is given by 

A(X) = {be - adf - 4{bd - c'^){ac - b^) . (6.6) 

As in the asymmetric case, the sign of the hyperdeterminant is invariant under invertible multilinear 
transformation (S,S,S) • X. 
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Lemma 6.2 Let be a real symmetric 2x2x2 tensor with slabs Xi and X2. 

(i) If ^2^1^ or X1X2 ^ has distinct real eigenvalues, then X has symmetric rank 2. 
(ii) //X2Xj^^ or X1X2 ^ has identical real eigenvalues, then X has symmetric rank at least 3. 
(Hi) //X2X^^ or J^i'K2^ has complex eigenvalues, then'K^ has symmetric rank at least 3. 

Proof. First, we prove (i). We consider X2X]^^. The proof for XiXg ^ is completely analogous. 
Note that since Xi is nonsingular, the symmetric rank of X is at least 2. Let X2Xj~"^ have distinct 
real eigenvalues Ai and A2. Using ()6.5|) . we have 



X2X^ 



1 

X y 



bd 



ac 



62 ' 



ad — be 
ac — b"^ 



(6.7) 



which has eigenvalues 



Ai 



y + A/y2 + 4x 



y - + 



(6.8) 



It can be verified that the eigenvectors of X2Xj~^ are a (1 Ai)'^ and jS (1 A2)"^, respectively. Next, 
we show that appropriate choices of a and (3 yield a symmetric rank-2 decomposition (|6.1|) for X. 
Let A contain the eigenvectors of X2Xj~^, i.e. 



a (3 
aAi /3A2 



(6.9) 



Note that a symmetric rank-2 decomposition (|6.1|) can be denoted as X^ = AA^ A^, with A^ = 
diag(afci, 0^2)) k = 1,2. The eigendecomposition of X2X^^ is then given by A A2Aj^^ A~^, which 
is consistent with ()6.9p since A.2A.'^^ = diag(Ai, A2). To obtain a symmetric rank-2 decomposition, 
it remains to solve the equations 



A-^Xi = Ai A^ 



13^ I3^\2 



A"^ X2 = A2 A'^ 



^2 



P^X2 0^X1 



(6.10) 



By writing out the entries of A -"^Xi and A "'^X2, it can be seen that a solution (a, /3) exists if and 
only if 

aAiA2-6(Ai + A2) + c = 0, 6A1 A2 - c(Ai + A2) + d = 0. (6.11) 

Using (|6.8p . we have A1A2 = —x and Ai -|- A2 = y- Combined with the expressions for x and y in 
(|6.7p . this verifies (j6.1ip . This completes the proof of (z). 

As shown in the proof of (i), if X has one slab nonsingular, say Xi, and symmetric rank 2, 
then X2X]^^ is diagonalizable. As also shown in the proof of (i), when X2X^^ has identical real 
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canonical form 



symmetric rank multilinear rank sign A 



(0,0,0) 

(1.1.1) 

(2.2.2) + 
(2,2,2) 
(2,2,2) 



Table 2: Orbits of symmetric 2x2x2 tensors under the action of invertible multilinear trans- 
formation (S, S, S) over the real field. The letters D and G stand for "degenerate" (zero volume 
set in the 4-dimensional space of symmetric 2x2x2 tensors) and "typical" (positive volume set), 
respectively. 

eigenvalues it does not have two linearly independent eigenvectors and, hence, is not diagonalizable. 
Therefore, in case (u) the symmetric rank of X is at least 3. The same holds in case {Hi). This 
completes the proof. □ 

Proposition 6.3 The orbits of real symmetric 2x2x2 tensors X under the action of invertible 
multilinear transformation (S, S, S) • X, are as given in Table\^ 

Proof. Let ei and 62 denote the first and second column of I2. Orbit Dq corresponds to the all-zero 
tensor. For X with symmetric rank 1, we have X = a a "X" a. There exists a nonsingular S with 
Sa = ei. Then (S, S, S) • X equals the canonical form of orbit Di in Table[2l which has multilinear 
rank (1,1,1). Analogously, for X with symmetric rank 2, we have X = ai 18) ai ai -|- a? , 
with ai and a2 linearly independent. There exists a nonsingular S with S[ai a2] = [ei 02]. Then 
(S, S, S) • X equals the canonical form of orbit G2 in Table [21 which has positive hyperdeterminant 
()6.6p and multilinear rank (2, 2, 2). 



Di : 



Go : 



























1 























1 




















1 





1 


1 





1 












G. 



-1 


1 


1 


1 
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Next, let X have symmetric rank 3 and decomposition ()6.ip with A = [ai a2 a^]. No two 
columns of A are proportional, since otherwise a symmetric rank-2 decomposition is possible. It 
follows that there exists a nonsingular S with 



a 1 



SA 



(6.12) 



f3 1 



This yields 



a 1 



1 1 



(S,S,S).X 



(6.13) 



1 1 



1 d 



with a = 1 + and d = 1 + We define orbits and G3 according to whether the hyper de- 
terminant A is zero or negative, respectively. Note that A > is associated with orbit G2. 

In Appendix C, we show that any pair of tensor X in orbit D3 or G3 is related to the canonical 
form Y of the orbit by an invertible multilinear transformation (S, S, S) • Y = X. 

We conclude our proof by showing that the symmetric rank of real symmetric 2x2x2 tensors 
is at most 3. Let X be as in (16. 5p . Suppose 6 7^ and c 7^ 0. Then (S, S, S) • X is of the form 
()6.13p for S = diag(/u,r7) with fi^ = c/lP' and rf = h/c?. Since ()6.13p has the symmetric rank-3 
decomposition (I6.12|) . the tensor X has at most symmetric rank 3. 

Next, suppose 6 = and c 7^ 0. We subtract a symmetric rank-1 tensor a (8) a (8) a with a = 7 ei 
from X such that (a — 7^)c > 0. Denote the resulting tensor by Z. It can be verified that Z2Zj"^ 
has distinct real eigenvalues. By Lemma 16.21 [i) it has symmetric rank 2. Combined with the 
subtracted rank-1 tensor, this implies a symmetric rank-3 decomposition of X- 

The case c = and 6 7^ can be dealt with analogously. When 6 = c = 0, a symmetric rank-2 
decomposition is immediate. Hence, the symmetric rank is at most 3. □ 

The following corollary follows from Lemma 16.21 and Proposition 16.31 It is the full analogue of 
Lemma 13. II 

Corollary 6.4 Let 'Khe a real symmetric 2x2x2 tensor with slabs Xi and X2. 
(i) //X2X^^ or X1X2 ^ has distinct real eigenvalues, then'K is in orbit G2. 
(ii) If ^2^1^ or X1X2 ^ has identical real eigenvalues, then X is in orbit D^. 
(Hi) If J^2^i^ or X1X2 ^ has complex eigenvalues, then X is in orbit G3. 

Proof. Since there is only one orbit with symmetric rank 2, the proof of (i) is the proof of 
Lemma 16.21 (i). Since the symmetric rank is at most 3, cases (ii) and {Hi) have symmetric rank 3. 
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As in the asymmetric case, the hyper determinant (j6.6p is equal to the discriminant of the character- 
istic polynomial of det(Xi)X2Xj^^ or det(X2)XiX2^^. Hence, case (ii) has A = and corresponds 
to orbit Ds, and case {in) has A < and corresponds to orbit G3. □ 

The hyperdeterminant (16. 6p is equal to the discriminant of the polynomial q{ui,U2) in the Sylvester 
Theorem (Theorem 16. ID for X in (16. 5p and R = 2. Indeed, we have 73 = a, 72 = 6, 71 = c and 
7o = d. The vector g = {go, gi, 92)'^ should satisfy 

deb 
c b a 

which implies 

go = ac — b'^ , gi = be — ad , g2 = bd — (? . (6.15) 

The discriminant of q{ui,U2) is given by g\ — Ago g2 which is equal to the hyperdeterminant ()6.6p . 
This establishes the equivalence between the Sylvester Theorem with R = 2 and the symmetric 
rank criteria of Lemma 16. 2[ 

7 Best rank-1 subtraction for symmetric 2x2x2 tensors 

Here, we consider the problem of determining the rank and orbit of X — Y, where X is a symmetric 
2x2x2 tensor and Y is a best symmetric rank-1 approximation of X. Obviously, if X is in orbit 
Di, then X — Y is in orbit Do- Next, we present our main result in Theorem 17. H which is the 
analogue of Theorem 14. 1[ It concerns generic symmetric 2x2x2 tensors, which are in orbits G2 
and G3. The full proof of Theorem 17.11 is contained in Appendix B. 

Theorem 7.1 Let J^be a generie symmetrie 2x2x2 tensor, and let Y_be a best symmetric rank-1 
approximation ofX^. Then almost all tensors X — Y are in orbit D3. 

Proof sketch. We proceed as in Section 5. It is shown in [HI section 3.5] that there are three 
stationary points y satisfying (|5.3p . and that these can be obtained as roots of a 3rd degree poly- 
nomial. We show that, for all three stationary points, we have A(X — Y) = 0, where Y is the 
corresponding rank-1 tensor. Finally, we show that the multilinear rank of X — Y equals (2, 2, 2) 
for these three rank-1 tensors Y. This suffices to conclude that X ~ X is in orbit D3. □ 

Hence, as in the asymmetric 2x2x2 case, for typical symmetric tensors in orbit G2, subtracting 
a best symmetric rank-1 approximation increases the symmetric rank to 3. For typical symmet- 
ric tensors in orbit 6*3, subtracting a best symmetric rank-1 approximation does not affect the 
symmetric rank. 



g = 0, 



(6.14) 
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In the proof of Theorem 17.11 in Appendix B, it is shown that the slabs of Z = X — Y are non- 
singular almost everywhere. Prom Lemma 16.21 it follows that Z2ZJ^^ has identical real eigenvalues, 
while X2Xj^^ has either distinct real eigenvalues or complex eigenvalues. Hence, also for symmetric 
2x2x2 tensors, the subtraction of a best rank-1 approximation yields identical real eigenvalues. 

We conclude this section with examples of X in orbits G2 and G3 such that X — Y is in orbit 

^3. 



Example 7.2 Let 



X 






1 


1 


1 


1 


1 


1 






X2X[ 



-1 



1 

-1 1 



(7.1) 



Since the latter has complex eigenvalues. Lemma 16.21 shows that X is in orbit G3. 

Next, we compute the best symmetric rank-1 approximation Y to Xj which has the form 



Vi 2/12/2 
2/i2/2 yivl 



yh^ yivl 
yivl yl 



(7.2) 



The stationary points ()5.3p are given by 



6 + 12 yivl + 6 yiy^ - 12 yiya - 6 yi 



6 yl + 12 ylyl + 6 y\y2 -12ym-6 yl 



0: 

0. 



(7.3) 
(7.4) 



It follows that yi 7^ and y2 7^ (if one of them equals zero, then both are zero and Y is all-zero). 
Muliplying ()7.4p by yi and subtracting ()7.3p multiplied by y2 yields 



6 (?/2 - yi) I 1/2 ^ t ^ 2/1)12/2 



-3- 



Vi 



0. 



(7.5) 



Hence, y2 = yi or y2 = (—3/2 it -v/5/2)yi. However, it can be verified that the latter is in 
contradiction with (f7l3l)-(f7^. Therefore, y2 = yi and it follows from (fT^ - fTil) that yf = y^ = 3/4. 
This yields ^ = 3/2 and 



1 

4 



3 3 
3 3 



3 3 
3 3 



Z = X - Y 



1 


-3 1 


1 1 


4 


1 1 


1 -3 



Hence, 

1 

-1 -2 

which has a double eigenvalue —1. Hence, Z is in orbit D3 by Lemma 16.21 



Z2Z-^ 



(7.6) 



(7.7) 



□ 
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Example 7.3 Let 



X 



3 


1 


1 


1 


1 


1 


1 


3 



1 

-1 4 



(7.8) 



Since the latter has real and distinct eigenvalues, Lemma 16.21 shows that x is in orbit G2. 

Analogous to Example \7.2\ it can be shown that the best symmetric rank-1 approximation of 
X is given by 

3 



1 


1 


1 


1 


1 


1 


1 


1 



We obtain 



Z = X- Y 



3 -1 


-1 -1 


-1 -1 


-1 3 



ZoZ 



-1 



2^1 



1 

-1 -2 



(7.9) 



(7.10) 



The latter has a double eigenvalue —1. Hence, Z is in orbit D3 by Lemma 



□ 



8 Discussion 

It is now rather well known that consecutively subtracting a best rank-1 approximation from 
a higher-order tensor generally does not either reveal tensor rank nor yield a "good" low-rank 
approximation. A numerical example and discussion is provided in [181 section 7]. Hence, a rank-1 
deflation procedure as is available for matrices, generally does not exist for higher-order tensors. 
We have given a mathematical treatment of this property for real 2x2x2 tensors. In Theorem l4.H 
we showed that subtracting a best rank-1 approximation from a generic 2x2x2 tensor (which 
has rank 2 or 3) results in a rank-3 tensor located on the boundary between the sets of rank-2 
and rank-3 tensors. Hence, for typical tensors of rank 2, subtracting a best rank-1 approximation 
increases the rank to 3. 

A generic 2x2x2 tensor X has rank 2 if X2X^^ has distinct real eigenvalues, and rank 3 if 
X2X^^ has complex eigenvalues; see Lemma |3. 11 If Y is a best rank-1 approximation of X, then 
Z = X — Y has rank 3 and lies on the boundary between the rank-2 and rank-3 sets, i.e. T^2^\^ has 
identical real eigenvalues. The rank-2 and rank-3 orbits G2 and G3 are characterized by positive 
and negative hyperdeterminant A, respectively, while on the boundary we have A = 0. The result 
that subtraction of a best rank-1 approximation yields identical real eigenvalues for Z2ZJ^^ is new 
and expands the knowledge of the topology of tensor rank. 

Numerical experiments yield the conjecture that for a generic real-valued p x p x 2 tensor X, 
subtracting its best rank-1 approximation Y results in Z = X — Y with Tj2^\^ having one pair of 
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identical real eigenvalues with only one associated eigenvector. Moreover, if the number of pairs of 
complex eigenvalues of X2X||~^ equals n, then Yi^^x^ has max(0, n— 1) pairs of complex eigenvalues. 
For n = 0, this implies that X has rank p and Z has rank p + 1 [T7] |34l lemma 2.2]. 

We also considered real symmetric 2x2x2 tensors. In Lemma 16.21 we provided a symmetric 
rank criterion via the eigenvalues of X2X||~^, which is similar to the asymmetric case. Symmetric 
tensors have rank 2 and 3 on sets of positive volume, and X2Xj^^ with distinct real eigenvalues 
implies A > and orbit G2, while X2Xj~^ with complex eigenvalues implies A < and orbit G3. 
When X2X^^ has identical real eigenvalues, it has A = and symmetric rank 3 (orbit -D3). The 
rank criteria of Lemma [6. 2 1 are equivalent to the well-known Sylvester Theorem for symmetric rank 
2. In Theorem 17.11 we showed that subtracting a best symmetric rank-1 approximation from a 
typical symmetric tensor yields a tensor in orbit D^, i.e. it has symmetric rank 3 and A = 0. This 
result is completely analogous to the asymmetric 2x2x2 case. 

A third case not reported here is that of 2 x 2 x 2 tensors with symmetric slabs, i.e. Xi2k = X2ik, 
k = 1,2. The rank-1 approximation problem is then 

minygR2^2gR2 ||X-y (»y ®z||^ (8.1) 

We can define a symmetric slab rank analogous to the symmetric rank and propose a rank criterion 
similar to Lemma [3.1l and Lemma [6. 2 1 Generic 2x2x2 tensors with symmetric slabs have symmetric 
slab ranks 2 and 3 on sets of positive volume. Moreover, a result analogous to Theorem 14.11 and 
Theorem 17.11 can be proven in this case. 
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Appendix A: Proof of Theorem 14.11 



We make use of the first part of Section 2. Let X be a generic 2x2x2 tensor with entries 



X 



a b 


e 


f 


c d 


9 


h 



(A.l) 



We consider the rank-1 approximation problem ()2.ip . It is our goal to show that, for the optimal 
solution Y = X ® y (8) z, we have X — Y in orbit D3. From the list of orbits in Table [H it follows 
that it suffices to show A(X — Y) = and X — Y has multilinear rank (2, 2, 2). We will do this by 
considering the stationary points of the rank-1 approximation problem. For later use, we mention 
that the hyperdeterminant ()3.2p of X in (jA.ip is given by 



A(X) = {ah -bg + de- cff -4 {ad- bc){eh - fg) . 



(A.2) 



We begin our proof by showing that for the best rank-1 approximation of X we have xi 7^ 0, 
X2 7^ 0, yi 7^ 0, y2 7^ 0, zi 7^ and 2^2 7^ almost everywhere. Due to the scaling indeterminacy in 
(x eg) y ® z), this implies that we may set yi = zi = 1 without loss of generality. 

Lemma A.l LefK. be a generic 2x2x2 tensor with a best rank-1 approximation x(8)y0z. Then 
xi 7^ 0, X2 7^ 0, yi 7^ 0, 2/2 / 0, zi 7^ 0, Z2 / almost everywhere. 

Proof. We show that zi 7^ almost everywhere. The proofs for yi, 2/2 and Z2 are analogous. The 
proofs for xi and X2 follow by interchanging the roles of x and y. 

Let the criterion ^ be as in (|2.6p and let ^'o denote (|2.6p with zi = 0. Then ^ < \I'o is equivalent 

to 

[{ayizi + by2Zi + eyiZ2 + /y2^2)^ + {cyizi + dy2Zi + gyiZ2 + hy2Z2f] > 

{z\ + zl) [(eyi + h2? + {gvi + hy2f'\ , (A.3) 
which, after setting yi = zi = 1, can be rewritten as 

(a^ + - - + 2z2 {ae + cg))+ 
2y2 {ah + cd — ef — gh + Z2 {af + be + ch + dg))+ 

yl{b^ + d^ - f -h^ + 2z2{bf + dh)) > 0. (A.4) 

Since (6/ + dh) 7^ almost everywhere, it is possible to choose Z2 such that the coefficient of 
is positive. Then there is a range of values 1/2 for which (|A.4p holds. This shows that, almost 
everywhere, we can find a better rank-1 approximation than setting zi = 0. This completes the 
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proof of zi / 0. □ 

As mentioned above Lemma lA.ll we set yi = zi = 1 without loss of generality. Since the optimal 
X is given by p.4p . the problem of finding a best rank-1 approximation of X is now a problem in 
the variables 1/2 and Z2 only. 

Next, we rewrite equations ()2.7p and ()2.8p specifying the stationary points (^2,-^2) as 

[{ef + gh) yl + (e^ + - f - h') y^ - (ef + gh)] + 

^2 [(0/ + be + ch + dg) y\ + 2 {ae + eg — bf — dh) 7/2 — (a/ + he + ch + dg)\ + 

[{ah + cd) yl + {a^ + -b^ - d^) y2 - (ab + cd)] = , (A.5) 



and 



zl [{bf + dh) yl + {af + be + ch + dg) 2/2 + {ae + eg)] + 
Z2 [{b" + d^-f- h^) yl + 2{ab + cd-ef- gh) y2 + {a^ + - e^ - g^)] + 

[- {bf + dh) yl - {af + be + eh + dg) y2 - {ae + eg)] = . ( A.6) 



Using the expression (j2.4p for x, also the hyperdeterminant (j3.2p of X — Y can be written as a 
function of {y2,Z2) only. After some manipulations, we obtain 

(1 + yl)\l + zlf A(X - Y) = [zl [{bg - de) yl + {ag - bh - ee + df) 2/2 + {cf - ah)] + 

Z2 [{be — ad + eh — fg) yl + (6c — ad + eh — fg)] + 
[{ah - cf) yl + {ag - bh - ce + df) 7/2 + {de - bg)] ] ^ . (A.7) 

Equations ()A.5|) and ()A.6p specifying the stationary points {y2, Z2), and the hyperdeterminant (|A.7p 
without the square, are of the same form: a polynomial of degree 4 in ?/2 and Z2 that is quadratic 
in both 1/2 and Z2. We use the result of the following lemma to compare the stationary points 
satisfying (jA.SP and (|A.6P to the roots of ()A.7p . 

Lemma A. 2 Let /(tt) = au^ + /5m + 7 and (/(u) = (5n^ + eii + z^ &e second degree polynomials. 
Then f and g have a common root if and only if 

{ae - 136) {I3v - £7) = (7.5 - av)'^ . (A.8) 



Moreover, if {-^5 — av) and (ae — (36) are nonzero, the common root is given by 

{jSu — £7) (7*5 — au) 
(7*5 — av) {ae — j3S) 



{AS 
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Proof. First, suppose / and g have a common root r. Then f{u) = a{u — r){u — ri) and 
g{u) = 5{u — r){u — r2) for some ri and r2- It fohows that 

/3 = — a(r + ri) e = —5{r + r2) ^ = arri v = 5rr2. (A. 10) 

Using these expressions, it can be verified that ()A.8|) holds, and r equals the expressions in ()A.9|) . 

Next, suppose ()A.8|) holds. Let f{u) = a{u — ri){u — r2) and g{u) = 5{u — r^){u — r^) for some 
'"1) ^"2) '"S) '"4- It follows that 

(3=—a{ri+r2) e = — (5 (rs + r4) 7 = Qrir2 = 5 rs r4 . (A. 11) 

Substituting these expressions into (jA.Sh and dividing both sides by q^J^ yields 

(n + r2 - rs - r4) (rir2 (rs + r^) - r^r4^ (n + r2)) = (rir2 - r3r4)^ . (A. 12) 

This can be rewritten as 

(n - r^){ri - ri){r2 - r^){r2 - r4) = , (A. 13) 

which implies that / and g must have a common root. As above, we have the expressions (IA.9P for 
the common root. □ 

Using Lemma IA.21 the stationary points (1/2,^2) are found as follows. Equations (jA.5p - (|A.6p rep- 
resent two quadratic polynomials in 1/2 that have a common root. Lemma IA.2I states that (jA.Sp 
must hold, where all coefficients are second degree polynomials in Z2- We rewrite this equation as 
pstat^^^^ _ where p^^^^ is a polynomial of degree 8. The 8 roots of are the Z2 corresponding 
to stationary points. For each Z2, the corresponding y2 is the common root given by ()A.9p . Hence, 
there are 8 stationary points (y2i -^2)1 and some of these may be complex. 

Instead of interpreting ()A.5P - (IA.6p as polynomials in y2^ we may interpret them as polynomials 
in Z2 with coefficients depending on 2/2- As above, the y2 of the stationary points are then found 
by finding the roots of an 8th degree polynomial Py^^^{y2) that is defined by (|A.8p . For each y2, 
the corresponding Z2 is the common root given by (|A.9p . Both ways of obtaining the stationary 
points necessarily yield the same result. 

Analogously, we may determine the points (7/2 1 Z2) satisfying (lA.Sp and having A(X — Y) = 
in (|A.7p . The same approach yields the points satisfying (IA.6P that are roots of (IA.7p . We denote 
the 8th degree polynomials corresponding to (jA.Sp and the roots of ()A.7p as Py^^^ and P"^^. We 
denote the 8th degree polynomials corresponding to ()A.6p and the roots of (IA.7P as Py^^"^ and P^^"^ ■ 
Using this approach, we obtain the following relation between the stationary points and the roots 
of ([XTl) . 
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(lA.Sp and (IA.6D 


yd) 


y(2) 


y(3) 


y{4) 


y{5) 


y{6) 


y{7) 


y(8) 




^(1) 


,m 


^(3) 


^(4) 


^(5) 


^(6) 




^(8) 


(IA.5P and root of (|A.7I) 


yd) 


y(2) 


y(3) 


y(4) 


y(5) 


y(6) 


y(9) 


y(10) 




^(1) 




z(3) 


^(4) 


z(5) 


^(6) 




^(8) 


(|A.6I) and root of (IA.7p 


yd) 


y(2) 


y(3) 


y(4) 


y(5) 


y(6) 


y(7) 


y(8) 




^(1) 


z(2) 


z(3) 


^(4) 


z(5) 


^(6) 


^(9) 


^(10) 



Table 3: Schedule of points (?/2, Z2) satisfying each pair of the equations (jA.Sp . ()A.6p . root of (lA.Tp . 
Equations (IA.5P - (|A.6p describe stationary points, while the roots of ()A.7p have A(X — Y) = 0. As 
can be seen, the points {y^^\ -z*-*^), i = 1, . . . , 6, satisfy all three equations. 



Lemma A. 3 The points {y2,Z2) satisfying two of the three equations \A.^ . <\A.(^ . root of ^A.l ), 
are related as specified in Table O In particular, 6 of the 8 stationary points are roots of pT?| ). 

Proof. Using symbolic computation software, it can be verified that 

Pt''\z2) ^ ^ Pf-\z2) ^ {eh - fg) zl + {ah - bg + de - of) Z2 + {ad - be) 

Pf^\z2) ' Pf^^ {Z2) {ad - be) zl - {ah - bg + de - cf) Z2 + {eh-fg)' 

PT'jy^) ^ 1 ^r'(^2) ^ {df - bh) yj - {ah + bg-de- cf) 7/2 + (ce - ag) 

P^-g2 ^y^^ ' peigi ^y^^ (eg _ y2 + + i,g_de- cf) y2 + {df-bh)- 

Hence, the roots Z2 of and PT^^ are identical, and so are the roots 2/2 of Py^^^ and Py^^"^ . 

Also, Pl**^* and Pz^^"^ have 6 of the 8 roots in common, as do P^*^* and Py^^^- This implies that 
the Z2-values of the stationary points coincide with the Z2-values of the points satisfying (jA.Sp that 
are roots of ()A.7p . Analogously, the 7/2-values of the stationary points coincide with the y2-values 
of the points satisfying (|A.6p that are roots of ()A.7p . Also, 6 of the Z2-values of the stationary 
points coincide with the 2;2-values of the points satisfying (1A.6P that are roots of ()A.7p . And 6 of 
the 7/2-values of the stationary points coincide with the y2-values of the points satisfying (lA.Sp that 
are roots of ()A.7p . 

In order to prove the relations in Table EJ it remains to show that the 6 common y2-values 
and the 6 common Z2-values form 6 common points {y2, 2^2) • Let Z2 be a root of and, hence, 

of P"^^. The corresponding 1/2 of the stationary point is the common root given by (lA.Op . The 
corresponding 7/2 of the point satisfying (|A.5p that is a root of (jAJl) is given by an analogous 
expression. Equating these two expressions for y2 yields an 8th degree polynomial in Z2 analogous 
to (IA.8p . We denote this polynomial as Using symbolic computation software, it can be 
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verified that 

^^^^^^2) ^ {eh - fg) zl + [ah - bg + de - cf) Z2 + {ad - be) 
Pz°'^{z2) (e/ + gh) zl + (a/ + be + ch + dg) Z2 + {ab + cd) ' ^ ' ^ 

Hence, p^^^^ and have 6 common roots. This imphes that 6 stationary points {y2, Z2) are also 
roots of (|A.7h . This completes the proof of the relations in Table [3l □ 

So far, we have shown that 6 of the 8 stationary points in the rank-1 approximation problem satisfy 
A(X — Y) = 0. In Lemma IA.4I below, we show that the two other stationary points {y^'^\ z^"^^) 
and {y^^\ z^^-*) correspond to x = in p.4p . which is not a best rank-1 approximation. The global 
minimum of the rank-1 approximation problem is thus attained in one of the stationary points 
z^*)), i = 1, . . . , 6. In Lemma lA.SI the proof of Theorem 14.11 is completed by showing that the 
multilinear rank of X — Y equals (2, 2, 2) for these stationary points. Together with A(X — Y) = 0, 
this implies that X — Y is in orbit D3. 

Next, we consider the two stationary points {y^'^\z^'^^) and {y'^^\ z^^'^). Note that y^"^^ and y^^^ 
are the roots of the numerator of (lA.lSp . y^^^ and y^'^^^ are the roots of the denominator of (jA.lSp . 
z^'^) and z*-^^ are the roots of the numerator of (IA.14p . and z*-^^ and z^^^-* are the roots of the 
denominator of (|A.14h . Moreover, these four polynomials of degree 2 have identical discriminant 
that is equal to the hyperdeterminant of X as given in (|A.2p . 

Hence, if A(X) < 0, i.e. X is in orbit G3, then the stationary points {y^'^\ z^'^'^) and {y'^^\z^^^) 
are complex. Since we only consider real-valued rank-1 approximations, we discard these two 
stationary points. If A(X) > 0, i.e. X is in orbit G2, we resort to Lemma |A.4[ 

Lemma A. 4 Suppose A(X) > 0. Then the stationary points {y^'^\z^'^^) and {y^^\ z(^)) in TablelE 
yield x = in ([j^.^p, and do not correspond to the global minimum almost everywhere. 



(A.17) 



Proof. It can be verified that y^"^^ and y*-*^^ are given by 



{ah + bg — de — cf) ± ^J {ah + bg — de — cf)"^ — 4(ci/ — bh){ce — ag) 

2 {df - bh) 

and z^'^^ and z^^^ are given by 



-{ah — bg + de — cf) ± {ah — bg + de — cf)"^ — 4(e/i — fg){ad — be) 



(A.18) 



2{eh-fg) 

where it is + in one stationary point and — in the other. Using symbolic computation software, it 
can be verified that the expression for x in (j2.4p is all-zero for {y^'^\z^'^^) and {y'^^\ z^^^). Hence, 
both stationary points yield the all-zero solution. This is not the global minimum since the solution 



x= , y= , z= , (A.19) 
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yields a lower criterion ^ in (|2.2|) when a ^ 0. This completes the proof. □ 

Lemma A. 5 For the stationary points (y^*^ = 1, . . . , 6, in Table the multilinear rank of 

X — Y equals (2, 2, 2) almost everywhere. 

Proof. Let Z = X — Y = X — x(g)y(g)z, where x is given by (|2.4|) . yi = zi = 1, and {y2, Z2) is 
a stationary point. If one of the frontal slabs Zi and Z2 of Z is nonsingular, then the mode-1 and 
mode-2 ranks of Z are equal to 2. Next, we show that det(Zi) = det(Z2) = corresponds to a set 
of measure zero. It can be verified that 

Z2 [-{de - hg) - {ag -hh-ce + df) y2 - {ah - cf) y\ + {ad - 6c)(l + 2:2] , . „„^ 
'^*(^^) = (l + yi)(l + .i) ' ^^-'^^ 



and 



(e/i - fg){\ + y|) + Z2 \-{ah - cf) + {ag - b h - ce + d/) y2 - {de - bg) yl] 

[l + yl){l + zl) 



det(Z2j = ^-1 , , 2\/i , 2\ • (A.2iJ 



Suppose det(Zi) = det(Z2) = 0, i.e. the numerators of the above expressions are zero. Since Z2 ^ 
almost everywhere (see Lemma lA.ip . we divide the numerator of det(Zi) by Z2- We then obtain 
two equations of the form Z2 = s{y2) /t{y2)- Equating both expressions for Z2 yields a fourth degree 
polynomial in 7/2 that can be written as 

[{ag - ce) yl - {ah + bg-cf- de) 2/2 + {bh - df)] [{df - bh) yl - {ah + bg-cf- de) y2 + {ce - ag)] = . 

(A.22) 

These two second degree polynomials are the numerator (times —1) and denominator of (jA.lSp . 
As explained above, the roots of these polynomials are complex if A(X) < 0. In this case, it is not 
possible to choose y2 and Z2 such that det(Zi) = det(Z2) = 0. When A(X) > 0, the sought values 
of y2 are y^^\ y^^^ and y^^^\ Therefore, in this case we may conclude that the points {y2,Z2) 
for which det(Zi) = det(Z2) = are not among the first 6 stationary points in Table [3] almost 
everywhere. 

Hence, the multilinear rank of Z equals (2,2,*). If one of the top and bottom slabs of Z is 
nonsingular, then also its mode-3 rank equals 2. A proof of this can be obtained analogous as above 
by interchanging the roles of x and z. This completes the proof. □ 
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Numerical examples 

Here, we illustrate the proof of Theorem 14.11 by means of two examples. We take two random X, 
one that has A(X) > (orbit G2) and one that has A(X) < (orbit G3). 
Our first example is 



X 



-0.4326 0.1253 
-1.6656 0.2877 



-1.1465 1.1892 
1.1909 -0.0376 



(A.23) 



We have A(X) = 2.7668. In the table below, we list the stationary points {1/2, Z2), their values of ^ 
in ()2.6p . their values of A(X — Y), and state whether their Hessian matrix is positive definite or not. 
Two of the stationary points (y*^*\ z^*^), i = 1, ... ,6, are complex. The remaining four points are 
the first four points in the table, and have A(X — Y) close to zero. The second point corresponds 
to the global minimum and is also found when computing a best rank-1 approximation to X via 
an alternating least squares algorithm. For Z = X ~ Y, the matrix Z2Z^^ has a double eigenvalue 
0.9185 with only one associated eigenvector. Lemma l3.ll implies that Z is in orbit D3. The last 
two points in the table are the stationary points {y^'^\ z^"^^) and {y^^\ z^^^). From Lemma lA.41 it 
follows that they have A(X - Y) = A(X) and ^ = ||X|p. 



2/2 Z2 


* A(X - Y) Hessian PD 


-0.592958 0.621735 
-0.229249 -1.08855 
2.22613 0.452035 
2.42488 -2.88759 


5.1164 1.4166e-12 no 
2.6863 9.6802e-13 yes 
7.1313 2.1210e-12 no 
6.5289 1.2999e-14 no 


1.17156 1.15843 
5.96728 -0.05296 


7.2081 2.7668 no 
7.2081 2.7668 no 



Our second example is 



X 



-1.6041 -1.0565 
0.2573 1.4151 



0.8156 1.2902 
0.7119 0.6686 



(A.24) 



We have A(X) = —2.7309. In the table below, we list the stationary points {1/2, Z2) in the same 
way as in the first example. Two of the stationary points (?/^*\ z*-*^), i = 1, ... ,6, are complex. 
Since A(X) < 0, the points {y^'^\ z^"^^) and {y^^\z^^^) are also complex. Hence, four real stationary 
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points are left, that all have A(X. — Y) close to zero. The first point in the table corresponds to 
the global minimum and is also found when computing a best rank-1 approximation to X via an 
alternating least squares algorithm. For Z = X — Y, the matrix T^2^\^ has a double eigenvalue 
1.6712 with only one associated eigenvector. Lemma |3 . 1 1 implies that Z is in orbit D3. 



2/2 Z2 


^ A(X-Y) Hessian PD 


0.995675 -0.598339 
-0.865475 0.0601889 
2.06437 1.78102 
-0.675154 9.24487 


3.1185 1.3801e-ll yes 
8.2319 1.5479e-13 no 
6.6050 1.6050e-13 no 
9.0028 2.6216e-13 no 



Appendix B: Proof of Theorem [TJJ 

We make use of the derivations in Section 5. Let X be a generic symmetric 2x2x2 tensor ()6.5p . 
We consider the symmetric rank-1 approximation problem ()5.ip . It is our goal to show that, for 
the optimal solution Y = y(X'y®y, we have X — Y in orbit D3 . From the list of orbits in Table [2l 
it follows that it suffices to show A(X — Y) = and X — Y has multilinear rank (2, 2, 2). We will 
do this by considering the stationary points of the symmetric rank-1 approximation problem. 
Let Y be as in (j7.2p . The stationary points are given by (j5.3p . which can be written as 

2/? + 2/i2/| + 2 2/?yi-2 6yiy2-a2/?-C2/i = 0, (B.l) 

yl^y\y2^ly\y\-lcy^y2-dyl-hy\ = 0. (B.2) 

Note that the entries a, h, c, d are nonzero almost everywhere. If one of yi and 7/2 is zero, it follows 
that both are zero almost everywhere. Since this corresponds to an all-zero Y, which is not the 
optimal solution, we may assume that yi 7^ and 2/2 7^ almost everywhere. 
Multiplying ()B.2p by yi and subtracting y2 times (IB.ip yields 

-hyl + {a- 2c) yly2 + (26 - d) yiyl + cyf = . (B.3) 

Defining z = 2/1/2/2 and dividing (jB.3p by y^-, we obtain 

-hz^ + {a-2c)z^ + {2h- d)z + c = Q. (B.4) 
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This yields three solutions for z = yi/y2, two of which may be complex. For each solution z, the 
corresponding stationary point (^1,1/2) satisfying ()B.ip - ()B.2p is given by 

az^ + 2bz + c bz'^ + 2cz + d 



yi = zy2, 



(B.5) 



where the latter equality is equivalent to (IB.4p . The polynomial (IB.4P determining the stationary 
points is also reported by O section 3.5]. 

Next, we consider the hyperdeterminant A(X — Y). For yi = zy2, we have 



X- Y 



a — z^ y2 h - z^ y2 
b- z^yl c- zyl 



b- z^yl c- zyl 
c- zyl d-yl 



(B.6) 



Using ()6.6p . we obtain 

/:^{'^-Y_) = + f{z)yl + {a-2,bz + 2,cz^ -dz^fyl, 

with 



(B.7) 



f{z) = [-4:b^ + Qabc-2a^d] + z [Qb^ c- 12 ac^ +Qabd] + z^ [Qbc^ -I2b^d + &acd] + z^ [Qbcd-2ad^] . 

(B.8) 

We substitute the second expression for yl in (1B.5|) into (IB.7|) and multiply by (z^ + 2z'^ + 1)^. 
Using symbolic computation software, it can be verified that this yields 



{-bz^ + (a - 2c) z^ + {2b-d)z + c) P{z) , 



(B.9) 



where P{z) is a 7th degree polynomial in z. By ()B.4p . the expression (IB. Oh is identical to zero. 
Hence, for all three stationary points {yi,y2), we have A(X — Y) = 0. 

In the final part of the proof, we show that X ~ Y has multilinear rank (2, 2, 2) almost every- 
where. Since the mode-n ranks of symmetric tensors are equal for each mode, it suffices to show 
that the two slabs of ()B.6p are nonsingular almost everywhere. Let Z = X — Y. We have 



det(Zi) = (ac-b^) + yl{-cz^ + 2bz^ - az) , 
det(Z2) = {bd-c^) + yl{-dz^ + 2cz-b). 

Hence, det(Zi) = det(Z2) = implies 

{bd- c^){cz^ -2bz^ +az) + {ac-b^){dz'^ - 2cz + 6) = 0, 

which can be written as 



(B.IO) 
(B.ll) 

(B.12) 



z^ [bed - e"] + z^ [2 bc^ + acd-3 b'^d] + z[2 b^c + abd - 3 ac^] + [abc - b^] = . 



(B.13) 



30 



Since the 3rd degree polynomials ()B.4p and ()B.13P do not have generically common roots, it fohows 
that at least one of the slabs Zi and Z2 is nonsingular almost everywhere. As explained above, 
this implies that X — Y has multilinear rank (2, 2, 2) almost everywhere. This completes the proof 
of Theorem 17. 1[ 



Appendix C: Orbits and G3 of real symmetric 2x2x2 tensors 



Here, we show that any real symmetric 2x2x2 tensor X in orbit -D3 or G3 is related to the 
canonical form Y of the orbit by an invertible multilinear transformation (S, S, S) • Y = X. 

First, we consider orbit D3, which is defined by symmetric rank 3, multilinear rank (2,2,2), 
and hyperdeterminant A = 0. It follows from the proof of Proposition 16.31 that we may assume 
without loss of generality that X in orbit D-^ has the form 



a 1 


1 


1 


1 1 


1 


d 



X 



with 

A(X) = a^d^ - 6ad + 4a + 4d - 3 = . 
Our goal is to find a nonsingular 

Sl S2 

S3 S4 

such that (S, S, S) • Y = X, where the canonical form Y of orbit Dy, is given in Table [2l i.e. 



(S,S,S) 



This yields the following four equations: 



o 2, 

6S1S2 = a. 



SiSi + 2S1S2S3 = 1 






1 


1 







a 1 


1 


1 


1 













1 1 


1 


d 



3 s|s4 = d , 



S2s| + 2S1S3S4 = 1 



(C.l) 



(C.2) 



(C.3) 



(C.4) 



(C.5) 
(C.6) 



Note that the case a = d = \ has A = but yields multilinear rank (1, 1, 1) and, hence, is not 
included in orbit D^. The case a = 0,d = 3/4isin orbit D3 and its solution of ()C.5P - (|C.6h is 



1 
1/2 1 



(C.7) 



31 



The case a = 3/4, d = can be treated analogously. In the remaining part of the proof we assume 
a 7^ and d 7^ 0. This implies that all entries of S are nonzero. From ([02]) it follows that 

3a-2±2(l-a)^/r^ 



d 



(C.8) 



Hence, we must have a < 1. Since (|C.2p is symmetric in a and d, also d < 1 must hold. 

Next, we solve the system (|C.5P - ()C.6p . From ()C.5p we get S2 = a/(3sf) and S4 = d/{3s1). 
Substituting this into (|C.6p yields, after rewriting, 

d f si\^ f si\ 2a d f si\^ 

We equate the right-hand sides of (|C.9p , which yields 



3 KsJ 



1 (si 

2 \S3 



a 
6 



(C.9) 



S3, 



a . 



(C.IO) 



(C.ll) 



Substituting this into one equation of ()C.9p gives us 

3-2a±3vT^ 

(i±^r^)3~' 

It can be verified that this expression for d is identical to (jC.Sp . Hence, equation (|C.10p . together 
with S2 = a/(3sf) and S4 = d/(3s|), solves the system (|C.5p - (|C.6p . Note that since both tensors 
in (|C.4p have multilinear rank (2, 2, 2), it follows that S is nonsingular. Hence, we have shown that 
for any X in orbit there exists a nonsingular S such that (S, S, S) • Y = X, where Y is the 
canonical form of orbit D^. 

Next, we consider orbit G3, which is defined by symmetric rank 3, multilinear rank (2,2,2), 
and hyperdeterminant A < 0. As above, we may assume that X in G3 has the form (jC.ip with 



A(X) = a^d^ _ 6ad + 4a + 4d - 3 < . 
It is our goal to find nonsingular S in (|C.7p such that 

(S,S,S)- 



(C.12) 



-1 





1 




a 1 


1 


1 


1 


1 







1 1 


1 


d 



(C.13) 



where the former tensor is the canonical form of orbit G3 as given in Table [2l This yields the 
following four equations: 



o 9 

S;^ + 3 S1S2 = a , 



S^Ss + 2 S1S2S4 + s|s3 = 1 , 



-si + 3S3S4 = d, 



-sisl + 2 S2S3S4 + S1S4 = 1 , 



(C.14) 
(C.15) 
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The case a = 0, d < 3/4 has sohition si = 0, S3 = 3/4 — d > 0, = l/ss, S4 = l/(4s3), with 
det(S) = — 1/S3 < 0. The case a < 3/4, d = can be treated analogously. In the remaining part of 
the proof we assume a 7^ and d ^ 0. This implies that si and ,53 are nonzero. Note that a = 1 
implies A = {d — 1)^, which is not in orbit G3. Analogously, d = 1 is not in orbit G3 either. In 
fact, A < implies a < 1 and d < 1. 

Next, we solve the system ()C.14p - (jC.15p . Expressions for S2 and 54 are obtained from (|C.14p as 

si =4^, si =4^. (C.16) 

3si 3 S3 

Equations dClTK]) can be written as 

1 + sf S3 - s|s3 l + sis|-sis| 
2 S2S4 = , 2 S2S4 = ^ . {^■^') 

Sl S3 

Equating the right-hand sides and substituting (IC.16P yields, after rewriting, 

The discriminant of this 3rd degree polynomial equals — 27A > 0, which implies that (IC.lSh has 
three distinct real roots. Let si = as3, where the root a satisfies 

da^ = 3a^ -3a + a. (C.19) 

Substituting si = a S3 and (|C.16p into the first equation of (|C.15p yields 

4sl{da'^ + 2aa-3a^) = 9 - 6a/a + /a^ - Aada . (C.20) 

Using (fCTTO]) . this can be rewritten as 

a{3-dar-Aad 
^ 12(a2_2a + a) ^ ^ 

It remains to verify that the expressions (1C.16P are nonnegative. Our proof is tedious and long. 
Below, we give a summary of it. The full proof is available on request. 

Substituting ()C.2ip and using (IC.lOp . it can be shown that the expressions ()C.16p are nonneg- 
ative if 

Pi{a) = a'^{4:-3d)+a{ad-3) + a>0, P2{a) = -da^ + a{3 - ad) - a > Q . (C.22) 

Note that Pi + P2 = 40;^ (1 — d) > 0. Also, the leading coefficient of Pi is always positive. The 
roots of Pi are given by 

3 -ad- ^a^d^ -h 6ad - 16a 9 3-ad^ ^a^d^ -h 6ad - 16a -h 9 ,^ , 

= ' = ■ ^ ^ ^ 
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The roots of P2 are given by 

ad -2,- \/a?d? - IQad + 9 ad - ?, + ^/^i^W^^miI+9 
= =2d ' = =2d • ^^-'^^ 

Let P-i{x) = d — 2) + 2> X — a. To prove (|C.22p . we focus on the sign of P3 in the roots ^"1, ^2, rs, r4. 

When the discriminant of Pi is nonnegative, we can distinguish three cases. In these cases, the 

sign of P3 in the roots ri and r2 is as fohows: 

case I : a > and a^d > -3o + 4a^/a Psin) > P3(r2) > , (C.25) 

case II : a > and a'^d < -3a - 4a^/a Psin) < P3(r2) < , (C.26) 

case III : a < ^ P^in) > P3(r2) > . (C.27) 

When the discriminant of P2 is nonnegative, we have 

{> i(d>0, 
(C.28) 
< if d < . 

Suppose d > 0. Then the leading coefficient of P2 is negative and its discriminant is positive (since 
a < 1 and d < 1). Hence, P2 has real roots. Recall that the leading coefficient of Pi is always 
positive. Suppose the roots of Pi are real. Then we are in case I or case III (since case II implies 
d < 0). Since Pi + P2 > 0, there must hold u < n < r2 < r^^. From ([025]), and ({(128]) . 

it follows that P3 has a root a in the interval [r4,ri] for which (IC.22P holds. If the roots of Pi are 
not real, then (IC.28P implies that P3 has a root a in the interval [r4,r3] for which (1C.22() holds. 

Suppose next that d < 0. Then the leading coefficients of Pi and P2 are positive. Suppose 
Pi and P2 both have real roots. Since Pi + P2 > 0, there must hold either r^, < < ri < r2 or 
fi < r2 < < r^. From (IC.25h - (IC.28|) we obtain the following. Suppose we are in case I or case 
III. If ^3 < r4 < ri < r2, then P3 has a root a in the interval [?'4,ri] for which (1C.22P holds. If 
fi < r2 < r-^ < r4^, then P3 has a root a in the interval [r2,r3] for which ()C.22|) holds. Suppose we 
are in case II. Then P3(r3) < and P3(?'i) < 0. From the shape of P3 it follows that it has a root 
a < r3 if r3 < r4 < ri < r2, or a root a < ri if ri < r2 < rs < r4. In both situations, we have 
(|n?22]) for this root a. 

When d < and Pi does not have real roots, ()C.28p implies that P3 has a root a < for which 
(|C.22p holds. When d < and P2 does not have real roots, (|C.25p - (|C.27p imply that P3 cannot 
have all three roots in the interval [ri,r2]. Hence, there exists a root a for which (jC.22p holds. 
Finally, it can be shown that Pi and P2 cannot both have complex roots when a < 1 and d < 1. 

Hence, we have shown that the system ()C.14p - (jC.15p is solved by (|C.16p . (|C.2ip . and si = a S3, 
where a is a root of P3 satisfying ()C.22p . In numerical experiments we found that any root of P3 
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satisfies ()C.22p . Note that since both tensors in ()C.13P have multilinear rank (2,2,2), it follows 
that S is nonsingular. Hence, we have shown that for any X in orbit G3 there exists a nonsingular 
S such that (S, S, S) • Y = X, where Y is the canonical form of orbit G3. 
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